Reinforcement learning reward function in unmanned aerial vehicle control tasks
نویسندگان
چکیده
Abstract This paper presents a new reward function that can be used for deep reinforcement learning in unmanned aerial vehicle (UAV) control and navigation problems. The is based on the construction estimation of time simplified trajectories to target, which are third-order Bezier curves. applied unchanged solve problems both two-dimensional three-dimensional virtual environments. effectiveness was tested newly developed environment, namely, environment describing dynamics UAV flight, taking into account forces thrust, inertia, gravity, aerodynamic drag. In this formulation, three tasks were successfully solved: flight given point space, avoidance interception by another UAV, organization one another. most relevant modern algorithms, Soft actor-critic, Deep Deterministic Policy Gradient, Twin Delayed Gradient used. All algorithms performed well, indicating selected function.
منابع مشابه
Learning Unmanned Aerial Vehicle Control for Autonomous Target Following
While deep reinforcement learning (RL) methods have achieved unprecedented successes in a range of challenging problems, their applicability has been mainly limited to simulation or game domains due to the high sample complexity of the trial-and-error learning process. However, real-world robotic applications often need a data-efficient learning process with safety-critical constraints. In this...
متن کاملUnmanned Aerial Vehicle Images
The main aim of this chapter is to give to the reader a complete overview about the general context in which the thesis is positioned. In a second part, the problems faced in the following chapters are introduced. Finally, we describe the proposed solutions and the thesis structure and organization. Chapter
متن کاملdesigning unmanned aerial vehicle based on neuro-fuzzy systems
در این پایان نامه، کنترل نرو-فازی در پرنده هدایت پذیر از دور (پهپاد) استفاده شده است ابتدا در روش پیشنهادی اول، کنترل کننده نرو-فازی توسط مجموعه اطلاعات یک کنترل کننده pid به صورت off-line آموزش دیده است و در روش دوم یک کنترل کننده نرو-فازی on-line مبتنی بر شناسایی سیستم توسط شبکه عصبی rbf پیشنهاد شده است. سپس کاربرد این کنترل کننده در پهپاد بررسی شده است و مقایسه ای ما بین کنترل کننده های معمو...
Autonomous Landing Unmanned Aerial Vehicle
This thesis presents the system architecture for landing an Unmanned Aerial Vehicle (UAV) from a hovering position without the intervention of a human operator. Through the use of feedback information from a height sensor, the UAV is commanded to perform controlled descent with the desired landing parameters by implementation of the flight control laws. The plant model of the system was determi...
متن کاملFuzzy Adaptive Control of Unmanned Aerial Vehicle for Carrying Time-Varying Cargo on Predefined Path
At present, the use of unmanned aerial vehicles (UAVs) has been increased dramatically. The reasons for this development are cheapness, smallness, simplicity, and diversity of missions. The simplicity of guidance and control of multi-rotor drones is that they are equipped with an autopilot system. This system is responsible for flying control. UAVs do not have a high weight and often have three...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of physics
سال: 2022
ISSN: ['0022-3700', '1747-3721', '0368-3508', '1747-3713']
DOI: https://doi.org/10.1088/1742-6596/2308/1/012004